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Abstract. HoefFding proved that Kendall's and Spearman's nonparametric measures of 
correlation between two continuous random variables X and Y are each asymptotically 
normal with an asymptotic variance of the form a-^/n - provided the non-degeneracy 
condition > holds, where is a certain (always nonnegative) expression which is 
determined by the joint distribution (say fi) of X and Y. Sufficient conditions for cr^ > 
in terms of the support set (say S) of fj, are given, the same for both correlation statistics. 
One of them is that there exist a rectangle with all its vertices in S, sides parallel to the 
X and Y axes, and an interior point also in S. Another sufficient condition is that the 
Lebesgue measure of S be nonzero. 



1. Introduction 

Let {X, Y) be a random point in with a (joint) cumulative distribution function 
(c.d.f.) F and continuous marginal c.d.f.'s Fx and Fy, so that F{x,y) = P{X ^ x,Y ^ y), 
Fx{x) = P{X ^ x), and Fyiy) = P{Y ^ y) for all real x and y. Then F is continuous as 
well, since \F{x2,y2)-F{xi,yi)\ ^ \Fx{x2)-Fx{xi)\ + \FY{y2)-FY{yi)\ for ah xi, yi, a;2, 2/2 
in R. Vice versa, if F{x^ y) is continuous in x for each real y, then = F{x, y) — F{x-', y) — 
P{X = x,Y ^ y) — > P{X — x) = Fx{x) — Fx{x—), so that Fx is continuous. Similarly, 

if F{x^ y) is continuous in y for each real x, then Fy is continuous. So, the marginal c.d.f.'s 
Fx and Fy are continuous iff the joint c.d.f. F is so; in such a case, one may simply say 
that the distribution of {X, Y) is continuous. 

Let ^ = nx,Y denote the measure that is the probability distribution of {X,Y). Let 
S — Sx,Y stand for the set in R^ that is the support of /x (defined as the intersection of all 
closed sets of /.t-measure 1; then S is the smallest of all such sets, and also 5' coincides with 
the set of all a; e R^ such that fi{B^{x)) > for all £ > 0, where Bs{x) is the (say open) 
disk of radius e centered at a;) . 

The most common nonparametric statistics that measure association between X and Y 
are Spearman's rank correlation and Kendall's difference sign correlation [2j, usually 
denoted by p and r, respectively. These statistics are based on a sample of independent 
random points (^i, Yi), . . . , (X„, Yn) each having the same distribution as the random point 
(X, Y) . In his landmark paper [1] , Hoeffding proved that p and r are each asymptotically 
normal as n cx) with an asymptotic variance of the form a'^/n — provided the non- 
degeneracy condition cr^ > holds, where k is either p or r, and is a certain (always 
nonnegative) expression which is determined by the c.d.f. F. It is therefore important to 
have convenient criteria for the non-degeneracy condition cr^ > 0. 
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2. Kendall's t 

As follows from Hoeffding [1] (9.13)], ct^ = iff the function defined by the formula 

drix,y) ■.= F{x,y)- {Fx{x)+Fy{v))/2 

is constant on a set (say A) of /i- measure 1; then dr must be constant on the closure of A 
(since dr is a continuous function) and hence on S. That is, one has 

Proposition 2.1. cr^ = iff dr is constant on S. 

Even in such simple cases as the ones considered in examples given at the end of this 
section, it may be not quite immediately obvious based on Proposition 12.11 whether the 
distribution of r is asymptotically non-degenerate, in the sense that cr^ > 0. However, we 
shall give simple conditions sufficient for such non-degeneracy. 

In what follows, the term rectangle means a nonempty set of the form {xi,X2] x (yi,y2]; 
the points {xi,yi), {xi,y2), (x2,yi), (2^2,2/2) in ^ are then naturally called the vertices of 
the rectangle. 

Lemma 2.2. Suppose that S contains all tfie four vertices and also an interior point of a 
rectangle; then > 0. 

Proof. Assume, to the contrary, that cr^ — 0. Then, by Proposition [^TTJ dr is constant on S. 
On the other hand, one has 5 points (a;*,y*), (a;i,yi), (a;i,?/2), (2^2,2/1), {x2,y2) in S such 
that xi < < X2 and yi < y* < y2- So, in view of the definition of 5, 

^2 -^-j < t^{ixi,X2) X (2/1,^2)) < n{{xi,X2] X {yi,y2]) 

= dr{x2,V2) - dr{xi,y2) - dr{x2,yi) +drixi,yi) = 0, 

which is a contradiction. □ 

One simple sufficient condition is an immediate corollary of Lemma 12.21 

Corollary 2.3. If the interior of S is non-empty, then cr^ > 0. 

Working a bit harder, one can get a stronger result. 
Let Xk denote the Lebesgue measure for R*^. 

Corollary 2.4. Suppose that X2{S) > 0; then > 0. 

Proof. Since can be partitioned into countable many disjoint rectangles, one has X2{S) > 
for some rectangle R, where S :— RH S. Further, S can be approximated by the union of 
disjoint rectangles i?i , i?2, • ■ • contained in R so that S C i?„ and A2 {S) > | A2 ( IJn ^n) ) 
that is, {X2{S n Rn) — |A2(i?„)) > 0, whence A2(^ n i?„j) > |A2(i?„i) for some natural 
TO. Shifting and re-scaling if necessary, without loss of generality let us assume that R„i — 
(0, 1]2 := (0, 1] X (0, 1]. Then, introducing the set M := S Ci Rm, one has 

M CSn (0, if and A2(M) > |. 

Now, by Fubini's theorem, Ai(M^) dx = A2(M) > f , where := {y e M: {x, y) 6 M}. 
Hence, the set A :— {x £ (0,1): Ai(M^) > |} is infinite (otherwise, one would have 

Ai(M'^) ^ I for almost all x in (0,1) and thus Xi{M^) dx ^ |). Therefore, there are 
xi, Xh., X2 in A such that xi < x^ < X2. Then, for M* := M=^i n M""' n M^^ one has 
Ai(M*) > 1 — 3(1 — |) = 0, so that the set M* is infinite and thus contains some yi, y*, y2 
such that yi < y* < y2. It remains to refer to Lemma 12.21 □ 
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Corollary 2.5. Suppose that the measure /i has a nonzero absolutely continuous component 
(with respect to the Lebesgue measure X2); then a"^ > 0. 

Proof. Let / be the density of the nonzero absolutely continuous component of /i. Then 
< / dX2 = Jsf dX2 (since ^ J^^^^ f dAa s$ \ -5") = O) . Therefore, A2 (S) > 0. It 
remains to refer to Corollarv l2.4l □ 

Observe that Corollary 12.41 strictly contains Corollary 12.51 Indeed, there is a random 
point {X, Y) with a continuous c.d.f. F such that X2{S) > while fj, is singular with respect 
to A2. For example, let X be any random variable with an everywhere strictly positive 
density (with respect to Ai). Next, let Y :^ X + Q , where Q is any random variable with 
values in the set Q of all rational real numbers such that P{Y = r) > for all r G Q. 
Then the random variable Y is absolutely continuous and P{{X,Y) G 5o) = 1, where 
So := lJreQ'L('^' ^ + ■ 2; G K.}. At that, A2(S'o) = 0, so that the measure fi is singular with 
respect to A2, whereas S" = and hence A2(S') > 0. 

This example also shows that CoroUarv 12.51 does not even contain Corollarv l2.3l On the 
other hand, it is easy to see that, vice versa, Corollary 12.31 does not contain CoroUarv 12.51 
indeed, let {X, Y) be uniformly distributed on a set of the form C xC, where C is any "fat" 
Cantor subset of M (e.g. the so-called Smith- Volterra-Cantor set, http : //en. wikipedia. org/wiki/Smith- Volte 
which is a non-empty compact nowhere-dense set such that Ai(C riB^{x)) > for all x G C 
and all e > 0; then the measure /it is absolutely continuous with respect to A2, whereas 
S ^ C X C, so that the interior of S is empty. The latter example also shows that Corol- 
lary [2T4] sfriciZj/ contains Corollarv l2.3l 

Note that the 5-point condition in Lemma is not necessary for cr^ > 0. For example, 
suppose that the support of the measure fj, is the union of straight line segments Si :— 
{{x,x): ^ X ^ 1} and S2 '■— ~ x): ^ ^ x ^ 1}. Then the 5-point condition in 

Lemma l2.2l is not satisfied, and yet, the function dr is not constant on either segment Si or 
S2, so that, by Proposition 12. 11 > 0. 

However, the role of the interior point in the 5-point condition in Lemma 12.21 is in a 
certain sense indispensable. 




(a) (b) 
Figure 1. Examples with constant on S, that is, with ct^ = 0. 

For example, let the random point (X, Y) be uniformly distributed on the union S of the 
four sides of the thick-line square shown in Fig. 1 (a). (More generally, one may assume 
that the distribution is continuous, symmetric about the center of the thick-line square, and 
has S as its support.) Obviously, here there are infinitely many rectangles with all the four 
vertices in S; one of them is shown here (dashed); however, there is no 5th, interior point in 
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any such rectangle which would also belong to the support S of the distribution of {X, Y). 
Accordingly, here cr^ = by Proposition [5111 since dr{X,Y) = —j with probability 1. 

Another example is illustrated by Fig. 1 (b), where it is assumed that the support of the 
distribution of the random point {X, Y) is the union of all the four thick-line components: 
Oj V^, 'y', and , whose probability masses are jj, ji, and respectively, and at 
that the (conditional) distribution on either one of the pieces O and is continuous and 
symmetric about the center of the piece. The (conditional) distribution on either one of 
the pieces and does not matter at all, as long as it is continuous. Here too there are 
infinitely many rectangles with all the four vertices in S, but the interior of none of them has 
a nonzero /x-mass. Accordingly, cr^ = 0, since dr{X,Y) = — ^ with probability 1. However, 

any weights of the pieces and other than j^-, jj, and ^ would result in 

^2 >0. 

These examples suggest that it would be difficult, if at all possible, to find a necessary 
and sufficient condition for the non-degeneracy from which one could easily deduce such 
results as Corollarv l2.4l 

3. Spearman's p 

By Hoeffding [1] (9.27)], dp = iff a certain function dp of the form 

dp{x,y) := Fx{x)FY{y) ~ fix) - g{y) 

for some continuous functions / and g is constant on a set of /i-measure 1 or, equivalently, 
on the support S. 

So, ap — implies that — again for any 5 points (x*,y*), (xi,yi), (xi,?/2), (a;2,2/i), 
(2^2,2/2) in 5* such that xi < < X2 and j/i < < 2/2 — all the relations (|2.1I) hold with 
IJ'X €5 fiy and dp instead of /i and dr , respectively; here, as usual, /ix and fiy stand for the 
distributions of X and Y, so that p.x ® /ir is the corresponding product measure. 

Thus, obtains the following result. 

Corollary 3.1. Lemma \2.2\ holds with ap in place of a-r, and so do Corollaries \2. 9\^2. 51 
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